Serveur d'exploration sur la recherche en informatique en Lorraine

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Optimizing the coverage of a speech database through a selection of representative speaker recordings

Identifieur interne : 005594 ( Main/Exploration ); précédent : 005593; suivant : 005595

Optimizing the coverage of a speech database through a selection of representative speaker recordings

Auteurs : Sacha Krstulovic [France] ; Frédéric Bimbot [France] ; Olivier Boëffard [France] ; Delphine Charlet [France] ; Dominique Fohr [France] ; Odile Mella [France]

Source :

RBID : Pascal:06-0450665

Descripteurs français

English descriptors

Abstract

In the context of the NEOLOGOS French speech database creation project,1 a general methodology was defined for the selection of representative speaker recordings. The selection aims at providing a good coverage in terms of speaker variability while limiting the number of recorded speakers. This is intended to make the resulting database both more adapted to the development of recently proposed multi-model methods and less expensive to collect. The presented methodology proposes a selection process based on the optimization of a quality criterion defined in a variety of speaker similarity modeling frameworks. The selection can be achieved with respect to a unique similarity criterion, using classical clustering methods such as hierarchical or K-medians clustering, or it can combine several speaker similarity criteria, thanks to a newly developed clustering method called focal speakers selection. In this framework, four different speaker similarity criteria are tested, and three different speaker clustering algorithms are compared. Results pertaining to the collection of the NEOLOGOS database are also discussed.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Optimizing the coverage of a speech database through a selection of representative speaker recordings</title>
<author>
<name sortKey="Krstulovic, Sacha" sort="Krstulovic, Sacha" uniqKey="Krstulovic S" first="Sacha" last="Krstulovic">Sacha Krstulovic</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>IRISA/METISS, Campus de Beaulieu</s1>
<s2>35042 Rennes</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Rennes</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Bimbot, Frederic" sort="Bimbot, Frederic" uniqKey="Bimbot F" first="Frédéric" last="Bimbot">Frédéric Bimbot</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>IRISA/METISS, Campus de Beaulieu</s1>
<s2>35042 Rennes</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Rennes</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Boeffard, Olivier" sort="Boeffard, Olivier" uniqKey="Boeffard O" first="Olivier" last="Boëffard">Olivier Boëffard</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s1>IRISAICORDIAL, 6 r. Kerampont, BP 80518</s1>
<s2>22305 Lannion</s2>
<s3>FRA</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Lannion</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Charlet, Delphine" sort="Charlet, Delphine" uniqKey="Charlet D" first="Delphine" last="Charlet">Delphine Charlet</name>
<affiliation wicri:level="3">
<inist:fA14 i1="03">
<s1>France Télécom R&D, 2 ave. Marzin</s1>
<s2>22307 Lannion</s2>
<s3>FRA</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Lannion</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Fohr, Dominique" sort="Fohr, Dominique" uniqKey="Fohr D" first="Dominique" last="Fohr">Dominique Fohr</name>
<affiliation wicri:level="3">
<inist:fA14 i1="04">
<s1>LORIA, Campus Universitaire, BP 239</s1>
<s2>54506 Vandoeuvre</s2>
<s3>FRA</s3>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandoeuvre</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Mella, Odile" sort="Mella, Odile" uniqKey="Mella O" first="Odile" last="Mella">Odile Mella</name>
<affiliation wicri:level="3">
<inist:fA14 i1="04">
<s1>LORIA, Campus Universitaire, BP 239</s1>
<s2>54506 Vandoeuvre</s2>
<s3>FRA</s3>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandoeuvre</settlement>
</placeName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">06-0450665</idno>
<date when="2006">2006</date>
<idno type="stanalyst">PASCAL 06-0450665 INIST</idno>
<idno type="RBID">Pascal:06-0450665</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000428</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000605</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000360</idno>
<idno type="wicri:explorRef" wicri:stream="PascalFrancis" wicri:step="Checkpoint">000360</idno>
<idno type="wicri:doubleKey">0167-6393:2006:Krstulovic S:optimizing:the:coverage</idno>
<idno type="wicri:Area/Main/Merge">005765</idno>
<idno type="wicri:Area/Main/Curation">005594</idno>
<idno type="wicri:Area/Main/Exploration">005594</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Optimizing the coverage of a speech database through a selection of representative speaker recordings</title>
<author>
<name sortKey="Krstulovic, Sacha" sort="Krstulovic, Sacha" uniqKey="Krstulovic S" first="Sacha" last="Krstulovic">Sacha Krstulovic</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>IRISA/METISS, Campus de Beaulieu</s1>
<s2>35042 Rennes</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Rennes</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Bimbot, Frederic" sort="Bimbot, Frederic" uniqKey="Bimbot F" first="Frédéric" last="Bimbot">Frédéric Bimbot</name>
<affiliation wicri:level="3">
<inist:fA14 i1="01">
<s1>IRISA/METISS, Campus de Beaulieu</s1>
<s2>35042 Rennes</s2>
<s3>FRA</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Rennes</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Boeffard, Olivier" sort="Boeffard, Olivier" uniqKey="Boeffard O" first="Olivier" last="Boëffard">Olivier Boëffard</name>
<affiliation wicri:level="3">
<inist:fA14 i1="02">
<s1>IRISAICORDIAL, 6 r. Kerampont, BP 80518</s1>
<s2>22305 Lannion</s2>
<s3>FRA</s3>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Lannion</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Charlet, Delphine" sort="Charlet, Delphine" uniqKey="Charlet D" first="Delphine" last="Charlet">Delphine Charlet</name>
<affiliation wicri:level="3">
<inist:fA14 i1="03">
<s1>France Télécom R&D, 2 ave. Marzin</s1>
<s2>22307 Lannion</s2>
<s3>FRA</s3>
<sZ>4 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Région Bretagne</region>
<settlement type="city">Lannion</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Fohr, Dominique" sort="Fohr, Dominique" uniqKey="Fohr D" first="Dominique" last="Fohr">Dominique Fohr</name>
<affiliation wicri:level="3">
<inist:fA14 i1="04">
<s1>LORIA, Campus Universitaire, BP 239</s1>
<s2>54506 Vandoeuvre</s2>
<s3>FRA</s3>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandoeuvre</settlement>
</placeName>
</affiliation>
</author>
<author>
<name sortKey="Mella, Odile" sort="Mella, Odile" uniqKey="Mella O" first="Odile" last="Mella">Odile Mella</name>
<affiliation wicri:level="3">
<inist:fA14 i1="04">
<s1>LORIA, Campus Universitaire, BP 239</s1>
<s2>54506 Vandoeuvre</s2>
<s3>FRA</s3>
<sZ>5 aut.</sZ>
<sZ>6 aut.</sZ>
</inist:fA14>
<country>France</country>
<placeName>
<region type="region" nuts="2">Grand Est</region>
<region type="old region" nuts="2">Lorraine (région)</region>
<settlement type="city">Vandoeuvre</settlement>
</placeName>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Speech communication</title>
<title level="j" type="abbreviated">Speech commun.</title>
<idno type="ISSN">0167-6393</idno>
<imprint>
<date when="2006">2006</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Speech communication</title>
<title level="j" type="abbreviated">Speech commun.</title>
<idno type="ISSN">0167-6393</idno>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithm</term>
<term>Automatic classification</term>
<term>Database</term>
<term>French</term>
<term>Modeling</term>
<term>Optimization</term>
<term>Quality criterion</term>
<term>Signal classification</term>
<term>Similarity</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Optimisation</term>
<term>Base donnée</term>
<term>Français</term>
<term>Critère qualité</term>
<term>Similitude</term>
<term>Modélisation</term>
<term>Classification automatique</term>
<term>Algorithme</term>
<term>Classification signal</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Base de données</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">In the context of the NEOLOGOS French speech database creation project,
<sup>1</sup>
a general methodology was defined for the selection of representative speaker recordings. The selection aims at providing a good coverage in terms of speaker variability while limiting the number of recorded speakers. This is intended to make the resulting database both more adapted to the development of recently proposed multi-model methods and less expensive to collect. The presented methodology proposes a selection process based on the optimization of a quality criterion defined in a variety of speaker similarity modeling frameworks. The selection can be achieved with respect to a unique similarity criterion, using classical clustering methods such as hierarchical or K-medians clustering, or it can combine several speaker similarity criteria, thanks to a newly developed clustering method called focal speakers selection. In this framework, four different speaker similarity criteria are tested, and three different speaker clustering algorithms are compared. Results pertaining to the collection of the NEOLOGOS database are also discussed.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>France</li>
</country>
<region>
<li>Grand Est</li>
<li>Lorraine (région)</li>
<li>Région Bretagne</li>
</region>
<settlement>
<li>Lannion</li>
<li>Rennes</li>
<li>Vandoeuvre</li>
</settlement>
</list>
<tree>
<country name="France">
<region name="Région Bretagne">
<name sortKey="Krstulovic, Sacha" sort="Krstulovic, Sacha" uniqKey="Krstulovic S" first="Sacha" last="Krstulovic">Sacha Krstulovic</name>
</region>
<name sortKey="Bimbot, Frederic" sort="Bimbot, Frederic" uniqKey="Bimbot F" first="Frédéric" last="Bimbot">Frédéric Bimbot</name>
<name sortKey="Boeffard, Olivier" sort="Boeffard, Olivier" uniqKey="Boeffard O" first="Olivier" last="Boëffard">Olivier Boëffard</name>
<name sortKey="Charlet, Delphine" sort="Charlet, Delphine" uniqKey="Charlet D" first="Delphine" last="Charlet">Delphine Charlet</name>
<name sortKey="Fohr, Dominique" sort="Fohr, Dominique" uniqKey="Fohr D" first="Dominique" last="Fohr">Dominique Fohr</name>
<name sortKey="Mella, Odile" sort="Mella, Odile" uniqKey="Mella O" first="Odile" last="Mella">Odile Mella</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Wicri/Lorraine/explor/InforLorV4/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 005594 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 005594 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Wicri/Lorraine
   |area=    InforLorV4
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:06-0450665
   |texte=   Optimizing the coverage of a speech database through a selection of representative speaker recordings
}}

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Jun 10 21:56:28 2019. Site generation: Fri Feb 25 15:29:27 2022